CU VOCAL Web Service: A Text-to-speech Synthesis Web Service for Voice-enabled Web-mediated Applications
نویسندگان
چکیده
This paper presents the implementation of the CU VOCAL Web service, one of the first Chinese text-to-speech synthesis Web services. The CU VOCAL Web service can be easily integrated with other Web services to develop innovative Web-mediated applications. We have developed a novel automatic voice alert system in the stocks domain by integrating CU VOCAL and several other Web services. This system can monitor a real-time financial information feed for alert conditions pre-specified in the user’s personalized profile, and trigger synthesized spoken messages to alert the user via the (mobile) telephone.
منابع مشابه
Recent enhancements in CU VOCAL for Chinese TTS-enabled applications
CU VOCAL is a Cantonese text-to-speech (TTS) engine. We use a syllable-based concatenative synthesis approach to generate intelligible and natural synthesized speech [1]. This paper describes several recent enhancements in CU VOCAL. First, we have augmented the syllable unit selection strategy with a positional feature. This feature specifies the relative location of a syllable in a sentence an...
متن کاملBuilding Text-To-Speech Voices in the Cloud
The AT&T VOICEBUILDER provides a new tool to researchers and practitioners who want to have their voices synthesized by a high-quality commercial-grade text-to-speech system without the need to install, configure, or manage speech processing software and equipment. It is implemented as a web service on the AT&T Speech Mashup Portal. The system records and validates users’ utterances, processes ...
متن کاملAT&T VoiceBuilder: A Cloud-Based Text-to-Speech Voice Builder Tool
The AT&T VOICEBUILDER provides a new tool to researchers and practitioners who want to have their voices synthesized by a high–quality, commercial–grade text-to-speech (TTS) system without the need to install, configure, or manage speech processing software and equipment. It is implemented as a web service on the AT&T Speech Mashup Portal. The proposed system records, processes, and validates u...
متن کاملConsiderations in the usage of text to speech (TTS) in the creation of natural sounding voice enabled web systems
The voice enabled web is a combination of XML based markup languages, speech recognition, text to speech (TTS) and web technologies. Key to the success of voice enabled web applications is the “naturalness” of the interface. Users are much more likely to interact with a system they feel comfortable with and that responds in a human like way. This paper describes the deployment of TTS in commerc...
متن کاملStrategies for Enterprise Voice Enabled Web Projects
The voice enabled web is a combination of XML based markup languages, speech recognition, text to speech (TTS) and web technologies. Key to the success of voice enabled web applications is the “naturalness” of the interface. Users are much more likely to interact with a system they feel comfortable with and that responds in a human like way. This paper describes the deployment of TTS in commerc...
متن کامل